Picture for Xudong Lu

Xudong Lu

School of Biomedical Engineering and Instrumental Science, Zhejiang University, Hangzhou, P.R. China, School of Industrial Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands

X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

Add code
Jun 01, 2026
Viaarxiv icon

OmniInteract: Benchmarking Real-World Streaming Interaction for Real-Time Omnimodal Assistants

Add code
May 26, 2026
Viaarxiv icon

Heterogeneous Multi-Agent Modeling for Measurement and Network Analysis of the Data Service Market

Add code
May 22, 2026
Viaarxiv icon

FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation

Add code
May 20, 2026
Viaarxiv icon

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

Add code
Apr 05, 2026
Viaarxiv icon

PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios

Add code
Jan 30, 2026
Viaarxiv icon

GLEAM: Learning to Match and Explain in Cross-View Geo-Localization

Add code
Sep 09, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Figure 1 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 2 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 3 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 4 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Viaarxiv icon

GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Add code
Mar 08, 2025
Viaarxiv icon

SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

Add code
Mar 08, 2025
Viaarxiv icon